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Abstract — We propose a new class of games, called Multi- 
Games (MG), in which a given number of players play a fixed 
number of basic games simultaneously. Each player can have 
different sets of strategies for the different basic games. In each 
round of the MG, each player will have a specific set of weights, 
one for each basic game, which add up to one and represent 
the fraction of the player's investment in each basic game. The 
total payoff for each player is then the convex combination, with 
the corresponding weights, of the payoffs it obtains in the basic 
games. The basic games in a MG can be regarded as different 
environments for the players, and, in particular, we submit that 
MG can be used to model investment in a global economy with 
different national or continental markets. When the players' 
weights for the different games in MG are private information or 
types with given conditional probability distributions, we obtain 
a particular class of Bayesian games. We show that for the class 
of so-called completely pure regular Double Game (DG) with 
finite sets of types, the Nash equilibria (NE) of the basic games 
can be used to compute a Bayesian Nash equilibrium of the 
DG in linear time with respect to the number of types of the 
players. We study a DG for the Prisoner's Dilemma (PD) by 
extending the PD with a second so-called Social Game (SG), 
generalising the notion of altruistic extension of a game in which 
players have different altruistic levels (or social coefficients). We 
show that, with respect to the SG we choose, the payoffs for PD 
give rise to two different types of DG's. We study two different 
examples of Bayesian games in this context in which the social 
coefficients have a finite set of values and each player only knows 
the probability distribution of the opponent's social coefficient. 
In the first case we have a completely pure regular DG for which 
we deduce a Bayesian NE. Finally, we use the second example 
to compare various strategies in a round-robin tournament of 
the DG for PD, in which the players can change their social 
coefficients incrementally from one round to the next. 

I. Introduction 

Game theory was originally introduced to model the be- 
haviour of rational agents in which players make independent 
decisions in order to maximise their utility or payoff in 
an economy fl]. The notion of Nash equilibrium (NE) has 
become the key concept in game theory since John Nash's 
celebrated proof of existence of a mixed Nash equilibrium for 
all finite games in 1950 Q. A similar notion of Bayesian NE 
is also at the basis of games with incomplete information as 
shown by Harsanyi in 1960's |3|. 

However, such an equilibrium strategy for players can only 
be useful for determining or predicting economic behaviour if 



it can be efficiently computed, whereas it has become clear 
in a number of papers that computation of a NE or even 
an approximate e-Nash equilibrium is in general a compu- 
tationally hard problem [41, |5|. Clearly, if the computation 
of a NE is unfeasible because of its high complexity, then its 
existence, despite having theoretical significance, has no value 
in practice. It is thus useful to have models in game theory 
for which the computation of a Nash equilibrium can be more 
efficiently done than in general. 

In this paper, we propose a class of games called Multi- 
Games (MG) which can be used to model economic behaviour 
when each player can allocate its resources in varying pro- 
portions to play in a number of different environments, each 
representing a basic game in its own right. Each player can 
have different sets of strategies for the different basic games. 
The payoff of each player in a MG is assumed to be the convex 
linear combination of payoffs obtained for the basic games 
weighted by the allocated proportions to them. Here we first 
present a simple example of a Double Game (DG) with two 
basic games. 

Consider two multinational companies which can invest in 
the national economies of two different countries Ri and R2 
with different cost of investment, rates of profit, labour value, 
interest rates etc. Suppose they need to decide in what ratio to 
divide their assets for investment in the two countries and, in 
addition, whether to enter into a particular venture or simply 
deposit their allocated assets in some bank in each country. We 
thus have two games, Gi for Ri and G2 for i?2, one for each 
national economy, each with two players and two strategies for 
entering (E) and not entering (N). Let nij denote the payoff 
function in Gi for player j (with i,j — 1,2). 

Suppose the first player invests A and 1 — A fractions of its 
assets in Ri and R2 respectively, and assume 7 and 1 — 7 are 
the corresponding fractions for the second player Then, the 
payoff to the first player for the strategy profiles {Xi,Yi) in 
Gi and {X2,Y2) in G2, with X^,Y, e {E,N} for i = 1,2, 
would be 

Xnii{Xi,Yi) + (1 - A)7r2i(X2,y2), 
whereas the payoff for the second player for the same strategy 



profile would be 

7^12(Xi,yi) + (1 -7)7r22(^2,>2). 

Now if the coefficients A and 7 are both private information 
and can each take only a finite number of values between zero 
and one, then the DG is reduced to a Bayesian game with a 
finite set of types for the two players and we can look for a 
Bayesian Nash equilibrium. The values and 1 will always 
be included in the set of possible types of each player and are 
called the extreme types. 

In the next section, we will define MG in general and then, 
for convenience and ease of presentation, we restrict ourselves 
to the class of 2-player DG in which each player has the same 
set of strategies in the two basic games. We later define the 
class of pure regular DG in which for the pairs of extreme 
types there are four pure NE in which the strategy of each 
player only depends on its own type. Similarly, for a DG with 
k and ^ types for the two players respectively, we define the 
notion of a completely pure regular DG where there are fc x ^ 
pure NE for all possible pairs of types of the two players in 
which the strategy of each player only depends on its own type. 
We then derive a test for establishing that a DG is completely 
pure regular in linear time with respect to the number of types 
of the players and show that a pure Bayesian equilibrium for 
completely pure regular DG can be obtained directly from this 
test, thus reducing the complexity of computation. 

We will then apply this framework to obtain a double game 
extension of the Prisoner's Dilemma (PD) to model prosocial 
behaviour In this DG for PD, the first game is the classical 
PD and the second game captures the social or moral gain for 
cooperation for each player We now review the concept of 
prosocial behaviour and moral gain in PD. 

The PD is considered a standard method for modelling 
social dilemmas ||6l, JT). In the 1980's, Axelrod organised 
two international round-robin tournaments in which strategies 
for the repeated PD competed with each other [8|, [9|. In 
the competition, tit-for-tat, i.e., cooperate on the first move 
and then reciprocate the opponent's last move, proved to be 
robust and became the overall winner of the tournaments |]8], 
Jg). Axelrod then promoted tit-for-tat, and the four associated 
characteristics of (i) be nice, (ii) reciprocate, (iii) don't be 
envious, (iv) don't be too clever, as the way reciprocal altruism 
has evolved ifTOl . The PD has also been used to model 
conditional altruistic cooperation, which has also been tested 
by real monetary payoffs [11]. 

However, when confronted with the choice to cooperate or 
defect, human beings not only consider their material score, 
but also the social and moral payoffs of any decision they 
make. This means that the material payoffs presented in the 
PD cannot provide a complete picture of the decision making 
process human beings follow. In fact, according to some re- 
searchers, human social evolution has a moral direction which 
has extended our moral campus in the course of thousands of 
years of increasing communication and complexity in human 
social organisation lH2l . ifTSl . Moreover, there are individual 
and temporal variations in pro-social attitudes of human beings 



with some making decisions more based on self-interest than 
others. A more adequate model of human behaviour should 
take into account these aspects of social evolution as well. 
The same applies to economic decisions by corporations or 
governments, in which actions taken can have significant social 
and environmental implications, which are not incorporated in 
the material gains they produce. In lfT4l . it was proposed that a 
coefficient of morality be introduced to the PD and the payoff 
values of the players be accordingly changed. The so-called 
altruistic extension of any finite strategic game was defined 
in [Tsl, which endows each player with an altruistic level in 
the unit interval which provides the weight of the pro-social 
attitude of the player This modification aims to reflect real- 
life situations and dilemmas more accurately by taking into 
account both material and moral/social gains. Thus, for each 
player, the payoff is a weighted, linear combination of the 
payoffs for the PD and the SG. In essence, it is a convex 
combination of the payoffs occurring from both material and 
social dilemmas. 

In this paper, we show that the DG, as an instance of MG, 
provides a generalisation of the altruistic extension in [151 
which can be considered as a DG with the first game identified 
as the original game and the second game as a symmetric 
altruistic game. In a general DG, the social or altruistic game 
is allowed to be non-symmetric, which means that in general 
the altruistic payoffs for the different players may be different 
even for the same strategy profile. 

We furthermore consider the DG for the PD where the social 
(altruistic) coefficient of each player forms a finite discrete 
set of incomplete information or types thus giving rise to a 
Bayesian game. We prove that this DG is in fact pure regular 
and determine its Bayesian equilibrium when it is completely 
pure regular. 

II. MuLTi Games 

The purpose of using this model is to add a new dimension 
to the description of a range of situations, achieved through 
the employment of game theoretic models. This is done by 
linearly combining the payoff matrices of various games and 
linking them through the use of a coefficient for each player, 
which represents the amount of investment that a player is 
willing to commit in that particular game. More specifically 
MG is defined as follows. Consider M finite A^-player games 
Gi{l < i < M) with the strategy set Sij and payoff matrix Tr^ 
for player j (1 < j < N) in the game G, . Assume each player 
j is equipped with a set of M weights with X]j=i ^ij — 1- 
We define the MG N-player game G with basic games Gi as 
the finite strategy game with players j (1 < j < N) each 
having strategy set ni<i<A/ ^ij ^"'^ payoff 

n ^ X! ^ij'^iji^^o) 

l<i<M l<i<M 

for strategy profile Sij G 5*^ and possibly incomplete in- 
formation (types) Xij for 1 < i < M. We say the MG is 
uniform if for each player j, the set Sij is independent of the 



game Gi, i.e., we can write Sij — Sj, and Sij — Si/j for 
1 < i < i' < M. 

Harsanyi il6j suggested a method to solve games with 
incomplete information by transforming it to a game with 
imperfect information, in which a probability distribution 
for each unknown value, referred to as a type, is provided. 
Such a game is called a Bayesian game and a Bayesian 
Nash equilibrium is then a Nash equilibrium in a Bayesian 
game [171 . 

In this extended abstract we restrict ourselves to uniform 
MG with two players N — 2. We now give a simple example 
in this case . Assume that the strategy set for each player 
consists of actions C and D and denote the weights for players 
1 and 2 respectively by Ay and 7^ , with 1 < i < M and 
j = 1, 2. If the payoff matrix for the basic game Gi is given 
as in TABLE m then the payoff matrix for the MG G will be 
given as in TABLE HH 



Player 2 
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D 
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(ctil, ai2) 


{bii,bi2) 


D 


(Cil, Ci2) 


{dii,di2) 



TABLE I; Payoff matrix of the basic games Gi 
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D 


c 






D 


Ei=l ^iCii, 7iCi2 


Y^fL\ ^idil, Efcl 7i<^i2 



TABLE II: Payoff matrix of the MG 



A. Coherent pairs of NE 

For finite sets of types, we assume that the weights Ay for 
each player j are selected from a finite discrete set, which 
would denote the types of each player. When these values are 
private information we have a Bayesian game. 

For simplicity in this paper, we suppose our MG is uniform 
with M = 2 and N = 2; our results however do extend to 
the more general case. Since we now have two basic games 
Gi and G2 and two players, we can denote the weights of the 
first player by 1 — A and A, and those of the second player 
by 1 — 7 and 7. This means that for A = the first player 
invests totally in the first game Gi whereas when A = 1, the 
first player invests totally in the second game G2. Similarly 
for the second player with weight 7. 

Thus, in the finite discrete case, the finite set of types of 
each player is given by a set of increasing values, say Am 
(1 < TO < fc) for A and a set of increasing values, say 7„ 
(1 < n < £) for 7, where each type is restricted to the unit 
interval. We assume in the discrete case that we always have 
and 1 as types for each player, i.e., Ai = 71 = and A^ = 



7f = 1, which we call the extreme types. We let G'^^'^^ denote 
the DG game G with the types taking the specific values A 
and 7. In the discrete case, we in addition let G'"" denote the 
DG game G with the types A™ and 7„ selected for the two 
players respectively. We refer to a NE for G^^''^^ as a local 
NE for the DG G. 

Assume we have a uniform two-player DG G with basic 
games Gi and G2. Given a player j, we denote as usual the 
strategy set of the opponent of j by S-j. 

Definition 1. The DG G has a coherent pair q/pure NE /or 
a player j with a given type if there is an action s G Sj and 
actions u,v G S-j such that (s, u) and (s, v) are respectively 
pure NE for G with the given type for player j and the two 
extreme types of the other player In this case, the pair of 
profiles {{s,u), {s,v)) is called the coherent pair of pure NE 
for player j with the given type. 

For example, suppose we fix a type A,,, with 1 < m < k 
for the first player, then the DG has a coherent pair of pure 
strategies for the first player with type A„i if there are actions 
s G 5*1 and u,v G S2 such that {s,u) is a pure NE for G™^ 
and (s, v) is a pure NE for G™^. 

Proposition 2. If the DG has a coherent pair ((s, u), {s,v)) 
of pure NE for the first player with type Am, then there exists 
an integer p with 1 < p < £ such that G™" has (s, u) as 
a pure NE for I < n < p and has (s, v) as a pure NE for 
p < n < i. 

There is also a version of the above result for mixed NE, 
which holds for the general case of continuous types A for 
player 1 and 7 for player 2. We denote the set of mixed 
strategies over the strategy set S by A4{S). 

Definition 3. The DG G has a coherent pair of mixed NE 
for a player j with a given type if there is a mixed strategy 
(7 G Ad{Si) and mixed strategies CTq, <Ji G M.{S-j) such that 
(cr, CTq) and (cr, ai) are respectively mixed NE for G with the 
given type for player j and the two extreme types of the other 
player In this case, the pair of profiles {{a,<Tif)^{a,(7i)) is 
called the coherent pair of mixed NE for player j with the 
given type. 

Thus, player one will have a coherent pair of mixed NE if 
there exists a G M{Si) and cto, fi G ^^(5*2) such that (cr, ctq) 
is mixed NE for G'^'" and (cr, cri) is mixed NE for G^'^ . 

Proposition 4. If the DG with continuous types A and 7 has 
a coherent pair of mixed NE (ct, CTq) and (a, ai) for the first 
player with a given type X, then for each 7 in the unit interval 
there exists a mixed strategy profile of the form (cr, cr^) that is 
a NEforG^^'^l 

Proof: For simplicity we present the proof for the case 
that each player has a strategy set with two actions: Si = 
{Gi,Di} and 5*2 = {G2, D2}, with the payoff matrices G(^'°) 
and G(^'i) given in TABLE HU] and TABLE |Tyl 

Suppose cr pCi + {l~p)Di and ~ P7G2, +{l—Pf)D2, 
where for 7 = and 7 = 1 we obtain the pair of coherent 



Player 2 





C 


D 


c 


(ai, 02) 




D 


(Cl, C2) 


(di,d2) 



TABLE III: PD - Payoff Mattix Representation 



Player 2 





C 


D 


c 


(ei, 62) 


(/l,/2) 


D 


(91,52) 


{hi,h2) 



TABLE IV: PD - Payoff Matrix Representation 

mixed NE (cr, cto) and (ct, cri). Then a long calculation shows 
that (CT,cr^) is a mixed NE for G^^'^'> if 

(l-7)potp(a-2-62) + (l-p)(c2-rf2)]+7Pl[p(e2-/2) + (l-p)(g2-/t2)] 
(l-7)b(a-2-b2) + (l-p)(c2-d2)l+7[p(e2-/2) + (l-p)(S2-/i2)] ' 

The general case can be proved in a similar way. 

■ 

B. Pure regular DG 

Next, we examine how information about the set of local 
pure NE for the DG for various types of the two players can 
be used to deduce the Bayesian NE for the DG. We say a 
DG is pure regular if it has a set of four pairs of pure NE for 
all extreme types, for which the strategy of each player only 
depends on its own type. Here is the exact definition. 

Definition 5. We say a DG for two players is pure regular ;/ 

there are four pure strategy profiles (s, u), (s, v), (t, u), (t, v) 
such that the two pairs ((s, m), (s, u)) and {(t,u), (t,v)) are 
respectively coherent pairs of pure NE for the first player 
with extreme types A = and A = 1 respectively, while the 
two pairs {{s,u), (t,u)) and {{s,v), {t,v)) are respectively 
coherent pairs of pure NE for the second player with extreme 
types 7 = and 7 = 1 respectively. We say that the four 
strategy profiles induce pure regularity. 

For a DG with a finite set of types for each player, we can 
go further as follows. 

Definition 6. We say a DG with finite sets of types given by 
Am (1 < m < k) and 7n fl < ri < £j is completely pure 
regular if there are pure strategies Sm € Si (1 < m < k) and 
Un E S2 (1 < n < i) such that the strategy profile {sm,Un) 
is a pure Nash equilibrium for the game G""" for 1 < n < i 
and 1 < m < k. 

It is clear that a completely pure regular DG is 
pure regular and thus our terminology is consistent. Note 
also that for a completely pure regular DG as above 



(n i<m<fc ni<Ti<f ^n) ^ P™^ Bayesian strategy in 
which the first player takes action Sm for type A,„ and the 
second player takes action ii„ for type 7„ . The above notions 
can also be extended to mixed regular and completely mixed 
regular DG. 

Lemma 7. The DG is completely pure regular if and only if for 
all conditional probability distributions for the types of the two 
players the Bayesian pure strategy (ni<m<fc ^m, ni<n<f "«) 
is a pure Bayesian NE. 

We can determine if a DG with finite types for the two 
players is completely pure regular as follows: 

(i) Test if the DG is pure regular for a set of four pure NE 
with extreme types. 

(ii) For each set of four pure NE, which induces pure 
regularity, use Proposition |2] to determine all the pure NE 
for all fc X ^ combinations of pairs of types for the two 
players; these will be the pure NE of G™" for 1 < m < k 
and 1 < n < ^. 

(iii) Finally check whether the set of fc x ^ pure NE induces 
a completely pure regular DG. 

These three tasks (i) - (iii) can be done in linear time (i.e., 
linear in the maximum number of types max(fc, £) for the two 
players). We therefore have the following theorem. 

Tlieorem 8. Given any DG with finite number of types for the 
two players, we can decide in linear time if it is completely 
pure regular, in which case a Bayesian pure NE is obtained 
in linear time. 

The above theorem can be extended to any A^-player MG 
with M basic games and finite number of types for each 
player A generalisation to completely mixed regular DG is 
also possible. In the next section, we present a pure regular 
DG with a finite number of types which will give us both an 
example of a completely pure regular and an example where 
this property fails. 

III. A DG FOR Prisoner's Dilemma 

The PD is a fundamental non-zero-sum problem of game 
theory that attempts to mathematically analyse the behaviour 
of individuals in a strategic situation, in which the success of 
each individual does not depend entirely on one's choice, but 
on the opponent's as well ifTol . Essentially, it is an abstract 
formulation of some common situations in which what is 
best for each person individually leads to mutual defection, 
whereas everyone would have been better off with mutual 
cooperation flOl. It has provided a tool for experimental 
studies in various disciplines, such as economics, social psy- 
chology, evolutionary biology and fields that are involved with 
modelling of social processes, such as behaviour in decision 
making [10| |18|. 

Each of the players competing in the PD has the choice 
to cooperate (C) or defect (D) and the payoff values gained 
by the combination of the aforementioned actions are T, R, P 
and S. 

The payoff matrix of the PD is presented in TABLE |V] 
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Player 1 
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(R, R) 


(S,T) 
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(T, 5) 





TABLE V: PD - Payoff Matrix Representation 
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Player 1 
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(Ml, Ma) 


(Mi.Mj) 


D 


(m;,m2) 


(m;,m;) 



TABLE VL SG - Payoff Matrix Representation 



The values of T, R, P and S satisfy the following two 
inequahties: 

T>R>P>S and R>{T + S)/2 

The first equation specifies the order of the payoffs and 
defines the dilemma, since the best a player can do is get 
T (i.e. the temptation to defect payoff when the other player 
cooperates), the worst a player can do is get S (i.e. the sucker's 
payoff for cooperating while the other player defects), and, in 
ordering the other two outcomes, R (i.e. the reward payoff 
for mutual cooperation), is assumed to be better than P (i.e. 
the punishment payoff for mutual defection) 110|. The second 
equation ensures that, in the repeated game, the players cannot 
get out of the dilemma by taking turns in exploiting each 
other. This means that an even chance of exploiting and being 
exploited is not as good an outcome for a player as mutual 
cooperation. Therefore, it is assumed that R is greater than 
the average of T and S IdOJ . Finally, a special case of the 
PD occurs when the apparent advantage of defecting over 
cooperating is not dependent on the opponent's choice and 
the disadvantage of the opponent defecting over cooperating 
is not dependent on one's choice, as can be illustrated in the 
following equation ifTol : 

T + S = P + R or R + P -T - S = 

A. The Social Game 

The SG encourages cooperation and discourages defection, 
as cooperating is usually considered to be the ethical and 
moral choice to make when interacting with others in social 
dilemmas. This can be done in different ways corresponding 
to different types of payoff matrices. Here, we will restrict to 
the case that the SG encourages cooperation and discourages 
defection for each player, independently of the action chosen 
by the other player 

We present the normal form and the mathematical for- 
mulation of the SG as follows. Assume that the competing 
participants in the SG are player 1 and player 2. Each of them 
has the choice to select between C and D. When they have both 
made their choice, the payoffs assigned to them are calculated 
according to payoff matrix |Vl] where Mi, M2 and M^, A/2 
satisfy: 

Ml > M'l, M2 > M2 

When All — M2 and M[ = M2, we will have a symmetric 
social game and our framework reduces to the altruistic 
extension in fTS'l. 



Thus, in the SG we treat in this paper, the players are 
individually and independently rewarded for cooperating and 
punished for defecting. This can be interpreted in the following 
way. Cooperation by an individual, independent of the action 
of the opponent, is socially rewarded by inducing a good 
conscience, whereas defection is punished by creating a guilty 
one. The values of Mi, M2 and M^, are assumed to be 
socially determined to correspond to the average moral norm 
in the given society and are considered to have evolved in the 
course of increasing complexity, communication and moral 
growth in human history. 

B. The Double Game 

Although the payoffs for the SG are determined by the social 
context of the game, there is still individual variation in pro- 
social behaviour of the players. We assume each player has 
a social coefficient taking values between and 1, which 
reflects how pro-social they are in practice in each round of 
the game. In our particular SG, the social coefficient of a 
player signifies how much the player cares about the morality 
or the social aspect of their action. The payoffs of the DG for 
each player are then the weighted sum or convex combination 
of the payoffs of the PD and SG using the player's social 
coefficient as represented in TABLE IVIII where A and 7 (with 
< A, 7 < 1) are the social coefficients of players 1 and 
2, respectively. Note that the two players can still play the 
standard version of the PD by selecting their social coefficients 
to be equal to 0, in which case the DG reduces to the PD. 





C 


D 


c 


(1 - A)_R + AMi, (1 - 7)R + 


(1 - A)5 + AMi, (1 - 7)T + 7S 


D 


(1 - A)T + AS, (1 - 7)5 + 7M2 


(1 - A)P + A5, (1 - 7)P + 75 



TABLE VII: DG - Payoff Matrix Representation 

In addition to the inequalities satisfied in the payoffs for 
PD and SG, we stipulate the two new inequalities below that 
connect the payoff values from both the PD and the SG: 

Ml, A/2 > {R + P)/2 and T> R> Mi > M2 > P > S 
or T> R> A/2 >Mi>P>S 

First, we argue that AIi and A/2 should be less than T, 
but greater than P. The former should hold, otherwise if Mi 
and A/2 are equal to or greater than T, then, there is no 
dilemma as to what the best strategy is (one should select 



the highest possible social coefficient and always choose C 
in order to achieve the highest available payoff), and, the 
SG loses its meaning. On the other hand, the latter should 
hold, because, if Mi and M2 are equal to or less than P, 
then, cooperation is discouraged, since one would have no 
incentive to select a high social coefficient and choose C. In 
addition, A/i and M2 should be strictly less than R, as we 
would like to encourage cooperation in the SG by assigning 
to it a payoff value that is somewhat less than the payoff 
value obtained through mutual cooperation in the PD. This, 
we believe, reflects more accurately real-life situations, as, 
in general, the decisions based on moral incentives do not 
bring high material benefits. Finally, we assume that Mi and 
M2 should be greater than the average of R and P, so that 
the dilemma of whether to cooperate or defect becomes more 
intense. 

Then, we argue that Mi and A/j should be equal to 5", so as 
to discourage defection with a high social coefficient, which 
would be self-contradictory, and, also, to punish, in a sense, 
defection, since Mi, M2 are the payoff value for defection in 
the SG, which, by its definition, should not give a high value 
to defection. 

The selection of the social coefficient reveals, in part, the 
strategy one will follow in a given game. To illustrate this with 
an example, note that the choice of social coefficient equal to 
1 implies cooperation, since defection would give a payoff 
of 0, and, similarly, the choice of social coefficient equal to 
most probably implies defection, since cooperation in that case 
would give a payoff of 0, unless it is mutual, in which case 
it would be beneficial. On the other hand, selecting a social 
coefficient between and 1 leaves room for more complex and 
sophisticated strategies. Finally, as we will see later on, in the 
implementation of the DG, certain restrictions are imposed 
on how much a player can increase or decrease the social 
coefficient in a single round. This is done, since, in general, 
humans do not change their moral values radically in a short 
amount of time. 

C. Double game with complete information 

If we assume that the players know each other's social 
coefficients prior to every game, the DG becomes a game 
with complete information and all payoffs are known to both 
players. In this section we focus on the analysis of the Nash 
equilibrium in this type of the DG. It is well known that the 
Nash equilibrium for the PD is mutual defection, represented 
by (D,D). However, from the perspective of the SG, the best 
response of any player is to cooperate, as this always leads 
to a better score as compared to defecting. As a result, if 
the players make their decisions with no concern for their 
opponents' behaviour, it leads to a Nash equilibrium of mutual 
cooperation, represented by (C,C). However, this simplicity 
cannot be incorporated in the DG, due to the inclusion of the 
social coefficient, which alters the reward for all outcomes. 

In accordance with the equilibrium, we must find out how 
the social coefficients of the two players alter the potential 
payoffs for the four possible outcomes of the game. The 



payoffs for each possible outcome change along with the 
variation in the social coefficients A and 7 of players 1 and 2, 
respectively, as shown in TABLE IVIIII 

At this stage, we consider the payoff equations for player 1 . 
We note that, by symmetry, a similar analysis can be conducted 
for player 2. Furthermore, without loss of generality, we 
assume the equality Mi = M2 = S, since the SG punishes 
defection. 

For A let us label the three crossing points of the payoff 
equations as X — ai for 7ri(D,D) = 7ri(C,D), A = 61 for 
7ri(D,C) = 7ri(C,C) and X ^ ci for 7ri(D,C) = 7ri(C,D). By 
equating the equations for each payoff, we find the values of 
the crossing points (similarly for 7) to be: 



P-S 



P-S 



ai 



02 = 



M1+P-2S 

- T-S + Mi^R' 
T-S 

Cl = ,r . rr. ^2 = 



M2 + P-2S 
T - R 



T-S + AI2-R 
T-S 



M1+T-2S' 



M2 +T-2S 



(1) 



(2) 



(3) 



We have the three following cases: 

> ai < &i < ci , 02 < 62 < C2 if P ~ S <T - R 
« 5i < ai < ci , 62 < 02 < C2 if P - S > T - R 
• ai — bi < ci , a2 ~ b2 < C2 if P — S — T — R 
We can obtain the Nash equilibrium points for Mi, M2 > R 
and Ml, M2 < i? and Mi = M2 = R; note that for all values 
of A and 7 that are greater than 61 or 62, the equilibria are 
equal. To illustrate the method, we will compute below the 
Nash equilibrium for the two generic cases of ai < 61, 02 < 
62 and 61 < ai, 62 < a2 when T > R > Mi > M2 > P > 
M[ = M^ = S. 

1) Case: ai < hi and 02 < b2: FIGURE [T] shows the 
variation in the payoffs resulting from each outcome of the 
DG with different values of A. We can describe the order 
of preference for all values of A lying between and 1 by 
using the values of ai,6i and ci and the functions shown in 
FIGURE[T1 and, then, we can obtain the equilibria for different 
social coefficients. For instance, the preference ordering of 
player 1 for < A < ai is: 



iD,C) > (C,C) > {D,D) > iCD) 



(4) 



Since the DG is a symmetric game, similar inequalities 
also hold for player 2, with the social coefficient being 7 
instead of A, and each outcome replaced with its mirror point. 
For instance, while (D,C) is shown to be the most preferable 
outcome for player 1 in FIGURE [T] (C, D) would take its 
place for player 2. The pair (A, 7) is a point of unit square 
[0,1] X [0,1]. 

The equilibria for different social coefficients in the case of 
fli < hi are given in FIGURE |2] for the generic sub-rectangles 
and are also presented in TABLE IVIIII which includes the 
boundary points of these 9 regions. Note that on any boundary 
point of 2 or 4 generic regions, the set of equilibria is precisely 
the union of equilibria in the neighbouring generic regions. 



ai < bi < ci 
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Fig. 1: Change of payoffs {T > R > Mi > M2 > P> M[ 
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Fig. 2: The set of equiHbria for each of the 9 generic regions 

(ai < 61, a2 < 62) 



2) Case: bi < ai and 62 < 02." FIGURE [3] illustrates the 
change in payoffs resulting from each outcome of the DG 
with different values of A in the case of 61 < ai. FIGURE]?] 
provides the set of equilibria for each of the 9 generic regions 
and TABLE ]IX] presents the equilibria for all possible social 
coefficients. 

D. Double game with incomplete information 

We now assume that the players do not know each other's 
social coefficients prior to any game, which means that they 
do not know the full values of the payoff matrix. Thus, the 
game has two-sided incomplete information. We assume that 
the social coefficient of each player has a finite number of 
possible values and that the probability distribution of the 
social coefficient is common knowledge between the two 
players. We also assume that, at the start of the game, each 
player is aware of the value of their own social coefficient 
(private information) or type, but not the value of the social 
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TABLE VIII: The equilibria for different social coefficients 

for fli < 61 and a2 < 62 
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Fig. 3: Change of payoffs (The variation of payoffs (T > R > 

Ml > M2 > P> M'l = = S) 



coefficient of the opponent. In that way, each player relies on a 
probabilistic inference to predict the opponent's actions. From 
TABLES IVIIII and IIXI we see immediately that for extreme 
types A,7 = 0,1, we have the pure NE {D,D), {C,D), 
{D, C) and (C, C) and it follows immediately that in both 
cases of oi < 61, 02 < 62, and bi < ai, 62 < 0,2, we have 
pure regular games. 

We present two specific examples with finite sets of types 
for the two players, one of which is actually the basis for the 
round-robin tournament in the next section as well. Assume 

T > R> M > P > M' = S and ai < 61, 02 < 62 
1 ) Example I: We choose the four discrete values, or types. 



Ai = 0, A2 = ai, A3 = 61, A4 = 1 

71 = 0, 72 = 02, 73 = b2, 74 = 1 

Note that A2 , 72 , A3 , 73 give the values at the boundaries of 
the three generic regions in the unit square with each other. 
Table ]X] gives the set of Nash equilibria for all possible pairs 

(Am,7„). 

From this table, we see that for pairs of types where there 
is a choice of pure NE, we can choose a pure NE such that 
we obtain TABLE IXll 

From the above, we see that the DG is completely pure 
regular with {DDCC, DDCC) as a pure Bayesian NE. 
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Fig. 4: The set of equilibria for each of the 9 generic regions 

(hi < ai, b2 < a2) 



a2 < -1 < 1 


(D,C) 


(D,C) 


(CO 


(C,C) 


(C.C) 


-y = a2 


(D,C) 


(D,C),(D,D),(C,C) 


(D,D),(C,C) 


(D,D),(C,C) 


(C,C) 


^2 < -V < 


(D,D) 


(D.D),(C.CI 


(D.DJ,(C,C) 


(D,DJ,(C,C} 


(C.C) 


7 = b2 


(D.D) 


(D,D),(C,C) 


(D.D),(C,C) 


(C.D).(D.D).(C.C) 


(C,D) 


< 7 < fa2 


(D,D) 


(D.DI 


(D.D) 


(C,D) 


(C,D) 




< A < bi 


X = bi 


bi < X < ai 


A = ai 


ai < \ < 1 



TABLE IX: The equilibria for different social coefficients for 

bi < fli and 62 < 02 



2) Example II: We take 5 discrete values, or types for each 
player as follows, 

Ai = 0, A2 = ai, A3 = ^ ^ \ A4 = fei, A5 = 1 
a2 + b2 

71 = 0, 72 = 0,2, 73 = ^ — , 74 02, 75 1 

Table IXIII gives the set of Nash equilibria for all possible 
pairs (Ai,7i). 

From this table, we can see that the DG is not completely 
pure regular We will use this DG in the next section for the 
repeated PD game. 

IV. Round Robin Tournament Analysis 

A computer tournament of the DG has been implemented to 
operate as a framework, where we could test the validity of the 
theoretical results, by comparing the performance of various 
competing strategies. The structure of the tournament is round- 
robin, and, thus, all the strategies compete in the iterated 
double game against all the other strategies and themselves 
once. Each game between any two strategies consists of 200 
rounds and the total score of a strategy in a game is the 
sum of the payoffs acquired from all the rounds. The average 
score of a strategy accounts for its robustness and stability, 
since the amount of strategies ensures the existence of a rather 
competitive environment. The numerical values we use for the 
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TABLE X: The equilibria for different social coefficients for 
Oi < 61 and a2 < ^2 with four types per player 
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TABLE XI: The equilibria chosen for different social coeffi- 
cients from TABLE |X] 



payoff values are the following: 

T 5, i? 3, Ml = A/2 = 2.5, P = 1, M[ ^ ^ S = 

These values are chosen precisely to create an environment, 
where a real dilemma is present. Given the aforementioned 
values for the payoffs, a strategy can score between and 
1000 in a single game. 

A score of can be obtained by either a strategy that has 
a social coefficient equal to and cooperates throughout the 
whole game, while the opponent only defects, or a strategy 
that has a social coefficient equal to 1 and defects throughout 
the whole game, irrespective of what its opponent does. 

On the other hand, a score of 1000 can only be obtained 
by a strategy that has a social coefficient equal to and 
defects throughout the whole game, while the opponent only 
cooperates. 

A score of 200 can be obtained by two strategies that have 
social coefficients equal to and mutually defect throughout 
the whole game. 

A score of 500 can be obtained by a strategy that has a 
social coefficient equal to 1 and cooperates throughout the 
whole game, irrespective of what the opponent does. 

Finally, a score of 600 can be obtained by two strategies 
that have social coefficients equal to and mutually cooperate 
throughout the whole game. 

The scores for a single game described here give us an idea 
of what to expect from the games, and, as we will see, the 
winner of the tournament has an average score of higher than 
600. 

A significant aspect of the tournament is the selection of a 
social coefficient by a strategy. As mentioned, we allow the 
social coefficient to be part of the discrete set of five distinct 
values, that we used in the theoretical analysis as in example 
II, 
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TABLE XII: The equilibria for different social coefficients 
for ai <hi, a2 < ^2 with five types per player. 



The strategies are allowed to change their social coefficients 
within the game and adapt them to the environment they face. 
This is done due to the fact that the repeated DG for the PD is 
assumed to reflect a range of situations, spanned over a finite 
interval of time, represented by the number of rounds of the 
game. As a result, since human beings can change their social 
values, according to the experiences that they acquire, it seems 
realistic to have a varying social coefficient for the repeated 
version of the DG for the PD. 

However, since it is difficult to quantify exactly by how 
much human beings change their social values, we only allow 
the strategies to change their social coefficients stepwise, and, 
as a result, they can either increase them or decrease them by 
one value at any round. This is done to avoid having strategies 
changing their social coefficients from a value of to that of 1 
in a single round, since, it is beUeved that, only under extreme 
and unprecedented circumstances would such a sudden change 
occur in one's social values. 

The strategies participating in the tournament vary in ways 
such as their algorithmic complexity, the choice they make for 
the first round of the game and their initial social coefficient. 
Some of them take into account the decisions that their oppo- 
nent has made up to the point of consideration in the game, 
some use probabihstic estimations and even randomness in 
making their decisions, some have already made up their mind 
and follow rules that do not change according to the flow of the 
game, some make use of Bayesian inference and others use the 
(A, 7) diagrams to make their decisions according to the Nash 
equilibria indications. In essence, a strategy consists of an 
algorithm and so it operates according to certain instructions, 
changes the social coefficient and provides the decision of 
whether to cooperate or defect. 

The initial social coefficient of a strategy shows its inten- 
tions, since a low social coefficient usually implies proneness 
to defection, while a high social coefficient on the other 
hand cooperation. Varying initial social coefficients across 
tournaments mean changing initial conditions, and, as a result, 
dynamic environments. Certain strategies have complex ways 
for deahng with their opponents' initial behaviour, and, so, 
what they may infer from it, may, in some cases, pre-determine 
the rest of the course of the game. In addition, most strategies 
have algorithms that modify their social coefficient in almost 
every round, thus, enabling them to achieve in the course of the 
game their optimal social coefficient for a particular game and 
adapt to the environment that has been developed from their 



opponents' actions. Then, they can respond effectively to both 
cooperative and defective behaviours and not be restricted by 
their choice of initial social coefficient. 

As mentioned in the theoretical part of the analysis of the 
DG, different kinds of behaviour are observed when A and 7 
change. 

With < A < a and < 7 < o, the Nash equihbrium is 
provided by (D, D), i.e. mutual defection. So, for such social 
coefficients, we expect to see defective behaviours as strategies 
try to recognise their opponents' intentions and see whether 
they can get away with defection, or they will face retahatory 
behaviour. 

With a < A < 6 and a < 7 < 6, the Nash equilibria are 
provided by (C, D) and (D, C), i.e. a game of brinkmanship. 
So, for such social coefficients, the player who defects first 
has an advantage and dominates by gaining from an oppo- 
nent's cooperation. However, with the increase of the social 
coefficient, cooperating can be beneficial, since a player can 
gain the reward from the SG. In that way, a lot of strategies 
change their behaviour at this stage and employ a cooperative 
approach to the game. 

With 6 < A < 1 and 6 < 7 < 1, the Nash equihbrium 
is provided by (C, C), i.e. mutual cooperation. So, for such 
social coefficients, strategies with cooperative behaviour can 
gain the social rewards and not suffer the social punishments 
from the SG. The various stages of the social coefficients' 
changes that alter the form of the DG are summarised below. 

. 0<A<a, 0<7<a: 
Nash equihbria: (D, D) 
PD 

• a<A<6, a<7<6: 

Nash equihbria: (C, D), (D, C) 
Chicken Game / Game of Brinkmanship 

• b<\<l, 6<7<1: 
Nash equihbria (C, C) 
Cooperation Game 

V. SEG 

The results of the tournament showed a clear winning strategy, 
whose average and cumulative scores were much higher than 
those of any other participating strategy. Its algorithm is 
a mixture of the results of the theoretical work and some 
conditions on how to alter its social coefficient, so as to adapt 
to the course of action of any game. This strategy was devised 
to take the maximum benefits of a game, irrespective of the 
nature of the opposing strategies. 

We call it SEG and explain its functionality as follows. SEG 
is based on two parts; deciding whether to cooperate or defect 
and altering its social coefficient based on some pre-defined 
conditions. For the former, it looks up the (A, 7) diagram and 
behaves as the Nash equilibria indicate, thus, its decision of 
whether to cooperate or defect depends only on the theoretical 
work and the results drawn from it. For the latter, it changes 
its social coefficient according to the following conditions: 



• If SEG chose C and its opponent chose C in the previous 
round, it does not change its social coefficient. 

• If SEG chose C and its opponent chose D in the previous 
round, it increases its social coefficient. 

• If SEG chose D and its opponent chose C in the previous 
round, it decreases its social coefficient. 

• If SEG chose D and its opponent chose D in the previous 
round, it increases its social coefficient. 

It should be also mentioned that its initial social coefficient 
is 0, the result of which is that in the first round it defects, 
since this is indicated by the Nash equilibria in the (A, 7) 
diagram. 

Its success can be attributed to several reasons. It works 
on the principle of adjusting its social coefficient based not 
only on its opponent's behaviour, but its own as well. If its 
opponent defected and it either cooperated or defected in the 
previous round, it increases its social coefficient to avoid the 
disastrous cycles of mutual defection, that would be caused by 
a low social coefficient, since that would be the indication from 
the Nash equilibria in the (A, 7) diagram. In that way, SEG, 
instead of trying to alter its opponent's behaviour by retaliating 
to its defections, realises that it is better off by increasing its 
social coefficient to get the social rewards from cooperation. If 
its opponent cooperated and it defected in the previous round, 
it decreases its social coefficient, as it sees that the opponent 
does not retaliate to its defections, so it exploits the situation 
and behaves as instructed from the Nash equilibria in the (A, 7) 
diagram, so as to get the temptation's payoff to defect. In a 
sense it exploits the opponent in that particular case, however, 
it is the opponent who allows that behaviour by not retaliating. 
In such cases, some strategies allow themselves to be taken 
advantage of, because, if they have a high social coefficient, 
they can get the social rewards from cooperation and not 
concern themselves about their opponents' actions. Finally, if 
both its opponent and it cooperated in the previous round, 
it keeps its social coefficient, so as to make this cooperation 
stable and sustainable. 

Although SEG seems a strategy that promotes defective 
behaviour, that is not the case, as with all the strategies 
that are cooperative and do not let themselves to be taken 
advantage of, cooperation emerges quickly. Also, in its choice 
of whether to cooperate or defect, it behaves as specified 
by the Nash equilibria in the (A, 7) diagram, so it does 
not behave oddly and unrealistically, such as defecting with 
a high social coefficient or cooperating with a low social 
coefficient. It should be also noted that in the points, where 
there exist multiple Nash equilibria, and there is a choice 
between cooperating and defecting, we follow our theoretical 
analysis and select defection, as this potentially maximises the 
payoff, given the particular payoff values. 

Another fact that shows the success of SEG is that we 
conducted two separate tournaments, where we changed the 
values of M, the social rewards payoff for cooperation, to 3 
and 2, respectively, to see whether the value of 2.5, which was 
our initial selection, biased the results. The result was that, in 
both cases, SEG was the winning strategy of the tournament. 



The rankings, the total scores of the 10 top-scoring strate- 
gies, the initial social coefficients and their decisions of 
whether to cooperate or to defect in the first round (when 
the choice is specified by the Nash equilibria of the (A, 7) 
diagram, we write NE) are presented in TABLE IXIIin 



Ranking 


Strategy 


Total Score 


Initial Social Coeff. 


Initial Choice 


1 


SEG 


121536.49 


0.00 


NE 


2 


ANE 


114694.48 


0.00 


NE 


3 


BNE 


113751.58 


0.00 


NE 


4 


ADAPTIVE 


112387.83 


0.00 


C/D 


5 


TQC 


109570.71 


0.29 


D 


6 


MTFT 


108649.00 


0.00 


D 


7 


MIXED 


103372.31 


1.00 


C 


8 


SD 


97643.53 


0.44 


C/D 


9 


DTFT 


97330.00 


0.00 


C 


10 


TFT 


96805.00 


0.00 


C 



TABLE XIII: Top 10 Scoring Strategies, Total Scores, Initial 
Social Coefficients and Initial Choices 



The rankings and the average scores of the 10 top-scoring 
strategies are presented in table 



Ranking 


Strategy 


Average Score 


1 


SEG 


633.00 


2 


ANE 


597.37 


3 


BNE 


592.46 


4 


ADAPTIVE 


585.35 


5 


TQC 


570.68 


6 


MTFT 


565.88 


7 


MIXED 


538.40 


g 


SD 


508.56 


9 


DTFT 


506.93 


10 


TFT 


504.19 



TABLE XIV: Top 10 Scoring Strategies, Average Scores 

Let us now isolate some of the games of SEG and examine 
them, so that its strengths in getting the maximum from a 
game, will become apparent. 

. SEG vs. ALLD: 

ALLD is a strategy that has an initial social coefficient 
equal to 0, it constantly chooses D and never changes 
its social coefficient. 

The game decisions between the two are as follows: 

SEG: D D D C D C C C C C...C...C 
ALLD: D D D D D D D D D D...D...D 

As we can see, since SEG initially has a social coefficient 
equal to 0, it chooses what the Nash equilibria of the 

' Please note that the algorithms of the strategies participating in the round- 
robin tournament are not presented here due to space constraints. 
-Same as 1. 



(A, 7) diagram indicate, which, for such a social 
coefficient, is D. However, although other strategies 
would keep choosing D against ALLD, and thus, get 
hurt from mutual defection, the algorithm of SEG 
makes it increase its social coefficient, and, from the 
seventh round and onwards, it has a very high social 
coefficient, and, thus, chooses C for the rest of the game. 
It gets the maximum possible of points from a clearly 
uncooperative strategy like ALLD. 

. SEG vs. ALLC: 

ALLC is a strategy that has an initial social coefficient 
equal to 1, it constantly chooses C and never changes its 
social coefficient. 

The game decisions between the two are as follows: 

SEG: D D D D D D D D D D...D...D 
ALLC: CCCCCC C CC C...C...C 

As we can see, since SEG initially has a social coefficient 
equal to 0, it chooses what the Nash equilibria of the 
(A, 7) diagram indicate, which, for such a social 
coefficient, is D. Then, it reaUzes that ALLC will not 
retaliate to its defections and keeps choosing D, so 
that it can constantly acquire the temptation to defect 
payoff. In some sense, it exploits the unwillingness 
of ALLC to retaliate, however, ALLC gets the social 
reward for cooperation, so it is not entirely taken 
advantage of, since a high social coefficient allows a 
strategy to be indifferent towards the opponent's actions. 
Again, SEG gets the maximum possible of points from 
a cooperative, that does not retaliate, strategy like ALLC. 

. SEG vs. TFT: 

TFT is a strategy that has an initial social coefficient 
equal to 1, it initially chooses C, and for the rest of the 
game chooses what its opponent chose in the last round. 
It does not change its social coefficient. 
The game decisions between the two are as follows: 

SEG: D D D D C C C C C C...C...C 
TFT: C D D D D C C C C C...C...C 

As we can see, since SEG initially has a social coefficient 
equal to 0, it chooses what the Nash equihbria of the 
(A, 7) diagram indicate, which, for such a social coeffi- 
cient, is D. TFT initially chooses C, but then retaliates 
to the defection of SEG and chooses D. Thus, although 
after the first round, SEG decreased its social coefficient, 
after the second round and the retaliatory behaviour of 
TFT, it increases its social coefficient, so as to avoid 
the destructive consequences of mutual defection. Then, 
after the fifth round, it has increased its social coefficient 
enough, so that the Nash equilibria indicate C as the 



choice for the next round. TFT reciprocates both coop- 
eration and defection, and, thus, chooses C in the next 
round, as well. Therefore, for the remaining rounds, both 
strategies choose C, and, cooperation based on reciprocity 
has emerged. Again, SEG gets the maximum possible 
points from an opponent that is cooperative, but protects 
itself, and retaliates to defection, as well. 
The conclusion to be drawn is that SEG seems to be a 
strategy that performs well under numerous environments and 
it shows signs of robustness and stability. It successfully puts 
the theoretical results into practice, since it uses the Nash 
equilibria, single or multiple, of the (A, 7) diagrams to make its 
decisions. It uses its social coefficient to promote cooperation 
based on reciprocity, to protect itself from purely defective 
strategies, and to exploit situations, where it realises that there 
is no willingness for retaliation from the opponent. Finally, 
although it starts by choosing D and by having a low initial 
social coefficient, it can quickly change its behaviour, should 
it see signs for cooperation based on reciprocity from the 
opponent's side. 

VI. Conclusion 

In this paper, we have introduced Multi-Games (MG) which 
can be used when a number of players need to divide up their 
resources according to different weights to a given number of 
games which are then played out simultaneously. For the case 
of a two-player Double Game (DG) in which each player has 
the same set of strategies for the two basic games, we have 
shown that when the possible weights for each player belong 
to a finite set of types, it is possible to decide in linear time if 
there is a set of local pure NE for all pairs of types, for which 
the strategy of each player only depends on its own type. In 
such a case we inmiediately obtain a Bayesian pure NE for 
the DG. 

This result, in future work, will be extended to mixed NE 
on the one hand and to any N-player MG on the other. 
A challenging question is if we can reduce the complexity 
of computing a Bayesian NE for a pure regular (but not 
completely pure regular) MG. 

We also extended the classical PD with a Social Game (SG) 
to obtain a DG which can model the prosocial behaviour of 
human agents. We showed that two types of DG are possible 
with some reasonable assumptions about the SG. For one of 
these two possibilities, we provided two examples for DG 
with finite sets of types for the players, of which the first 
was completely pure regular, with a pure Bayesian NE which 
was directly determined, while the second was not. 

Finally, we conducted a tournament consisting of various 
strategies for the repeated DG of the second example above. 
This provided a testing framework for our work, and we 
devised a winning strategy. In this repeated version of the 
DG for future work, we could keep, in each round of a game, 
separate scores for the material and social components of each 
strategy. Then at the end of the final game, we can take a 
convex combination of the two total scores by applying a 
global weight, which would be determined by society as a 



whole. This we believe would add to the degree of realism 
that the DG aims to achieve. 
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